109 research outputs found

    Predicting residue contacts using pragmatic correlated mutations method: reducing the false positives

    Get PDF
    BACKGROUND: Predicting residues' contacts using primary amino acid sequence alone is an important task that can guide 3D structure modeling and can verify the quality of the predicted 3D structures. The correlated mutations (CM) method serves as the most promising approach and it has been used to predict amino acids pairs that are distant in the primary sequence but form contacts in the native 3D structure of homologous proteins. RESULTS: Here we report a new implementation of the CM method with an added set of selection rules (filters). The parameters of the algorithm were optimized against fifteen high resolution crystal structures with optimization criterion that maximized the confidentiality of the predictions. The optimization resulted in a true positive ratio (TPR) of 0.08 for the CM without filters and a TPR of 0.14 for the CM with filters. The protocol was further benchmarked against 65 high resolution structures that were not included in the optimization test. The benchmarking resulted in a TPR of 0.07 for the CM without filters and to a TPR of 0.09 for the CM with filters. CONCLUSION: Thus, the inclusion of selection rules resulted to an overall improvement of 30%. In addition, the pair-wise comparison of TPR for each protein without and with filters resulted in an average improvement of 1.7. The methodology was implemented into a web server that is freely available to the public. The purpose of this implementation is to provide the 3D structure predictors with a tool that can help with ranking alternative models by satisfying the largest number of predicted contacts, as well as it can provide a confidence score for contacts in cases where structure is known

    A transient homotypic interaction model for the influenza A virus NS1 protein effector domain

    Get PDF
    Influenza A virus NS1 protein is a multifunctional virulence factor consisting of an RNA binding domain (RBD), a short linker, an effector domain (ED), and a C-terminal 'tail'. Although poorly understood, NS1 multimerization may autoregulate its actions. While RBD dimerization seems functionally conserved, two possible apo ED dimers have been proposed (helix-helix and strand-strand). Here, we analyze all available RBD, ED, and full-length NS1 structures, including four novel crystal structures obtained using EDs from divergent human and avian viruses, as well as two forms of a monomeric ED mutant. The data reveal the helix-helix interface as the only strictly conserved ED homodimeric contact. Furthermore, a mutant NS1 unable to form the helix-helix dimer is compromised in its ability to bind dsRNA efficiently, implying that ED multimerization influences RBD activity. Our bioinformatical work also suggests that the helix-helix interface is variable and transient, thereby allowing two ED monomers to twist relative to one another and possibly separate. In this regard, we found a mAb that recognizes NS1 via a residue completely buried within the ED helix-helix interface, and which may help highlight potential different conformational populations of NS1 (putatively termed 'helix-closed' and 'helix-open') in virus-infected cells. 'Helix-closed' conformations appear to enhance dsRNA binding, and 'helix-open' conformations allow otherwise inaccessible interactions with host factors. Our data support a new model of NS1 regulation in which the RBD remains dimeric throughout infection, while the ED switches between several quaternary states in order to expand its functional space. Such a concept may be applicable to other small multifunctional proteins

    The Transmembrane Domain of CEACAM1-4S Is a Determinant of Anchorage Independent Growth and Tumorigenicity

    Get PDF
    CEACAM1 is a multifunctional Ig-like cell adhesion molecule expressed by epithelial cells in many organs. CEACAM1-4L and CEACAM1-4S, two isoforms produced by differential splicing, are predominant in rat liver. Previous work has shown that downregulation of both isoforms occurs in rat hepatocellular carcinomas. Here, we have isolated an anchorage dependent clone, designated 253T-NT that does not express detectable levels of CEACAM1. Stable transfection of 253-NT cells with a wild type CEACAM1-4S expression vector induced an anchorage independent growth in vitro and a tumorigenic phenotype in vivo. These phenotypes were used as quantifiable end points to examine the functionality of the CEACAM1-4S transmembrane domain. Examination of the CEACAM1 transmembrane domain showed N-terminal GXXXG dimerization sequences and C-terminal tyrosine residues shown in related studies to stabilize transmembrane domain helix-helix interactions. To examine the effects of transmembrane domain mutations, 253-NT cells were transfected with transmembrane domain mutants carrying glycine to leucine or tyrosine to valine substitutions. Results showed that mutation of transmembrane tyrosine residues greatly enhanced growth in vitro and in vivo. Mutation of transmembrane dimerization motifs, in contrast, significantly reduced anchorage independent growth and tumorigenicity. 253-NT cells expressing CEACAM1-4S with both glycine to leucine and tyrosine to valine mutations displayed the growth-enhanced phenotype of tyrosine mutants. The dramatic effect of transmembrane domain mutations constitutes strong evidence that the transmembrane domain is an important determinant of CEACAM1-4S functionality and most likely by other proteins with transmembrane domains containing dimerization sequences and/or C-terminal tyrosine residues

    Islands beneath islands: phylogeography of a groundwater amphipod crustacean in the Balearic archipelago

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metacrangonyctidae (Amphipoda, Crustacea) is an enigmatic continental subterranean water family of marine origin (thalassoid). One of the species in the genus, <it>Metacrangonyx longipes</it>, is endemic to the Balearic islands of Mallorca and Menorca (W Mediterranean). It has been suggested that the origin and distribution of thalassoid crustaceans could be explained by one of two alternative hypotheses: (1) active colonization of inland freshwater aquifers by a marine ancestor, followed by an adaptative shift; or (2) passive colonization by stranding of ancestral marine populations in coastal aquifers during marine regressions. A comparison of phylogenies, phylogeographic patterns and age estimations of clades should discriminate in favour of one of these two proposals.</p> <p>Results</p> <p>Phylogenetic relationships within <it>M. longipes </it>based on three mitochondrial DNA (mtDNA) and one nuclear marker revealed five genetically divergent and geographically structured clades. Analyses of cytochrome oxidase subunit 1 (<it>cox1</it>) mtDNA data showed the occurrence of a high geographic population subdivision in both islands, with current gene flow occurring exclusively between sites located in close proximity. Molecular-clock estimations dated the origin of <it>M. longipes </it>previous to about 6 Ma, whereas major cladogenetic events within the species took place between 4.2 and 2.0 Ma.</p> <p>Conclusions</p> <p><it>M. longipes </it>displayed a surprisingly old and highly fragmented population structure, with major episodes of cladogenesis within the species roughly correlating with some of the major marine transgression-regression episodes that affected the region during the last 6 Ma. Eustatic changes (vicariant events) -not active range expansion of marine littoral ancestors colonizing desalinated habitats-explain the phylogeographic pattern observed in <it>M. longipes</it>.</p

    Structure and Stability of the Spinach Aquaporin SoPIP2;1 in Detergent Micelles and Lipid Membranes

    Get PDF
    Background: SoPIP2;1 constitutes one of the major integral proteins in spinach leaf plasma membranes and belongs to the aquaporin family. SoPIP2;1 is a highly permeable and selective water channel that has been successfully overexpressed and purified with high yields. In order to optimize reconstitution of the purified protein into biomimetic systems, we have here for the first time characterized the structural stability of SoPIP2;1. Methodology/Principal Finding: We have characterized the protein structural stability after purification and after reconstitution into detergent micelles and proteoliposomes using circular dichroism and fluorescence spectroscopy techniques. The structure of SoPIP2;1 was analyzed either with the protein solubilized with octyl-beta-D-glucopyranoside (OG) or reconstituted into lipid membranes formed by E. coli lipids, diphytanoylphosphatidylcholine (DPhPC), or reconstituted into lipid membranes formed from mixtures of 1-palmitoyl-2-oleoyl-phosphatidylcholine (POPE), 1-palmitoyl-2-oleoyl-phosphatidylethanolamine (POPE), 1-palmitoyl-2-oleoyl-phosphatidylserine (POPS), and ergosterol. Generally, SoPIP2;1 secondary structure was found to be predominantly a-helical in accordance with crystallographic data. The protein has a high thermal structural stability in detergent solutions, with an irreversible thermal unfolding occurring at a melting temperature of 58 degrees C. Incorporation of the protein into lipid membranes increases the structural stability as evidenced by an increased melting temperature of up to 70 degrees C. Conclusion/Significance: The results of this study provide insights into SoPIP2;1 stability in various host membranes and suggest suitable choices of detergent and lipid composition for reconstitution of SoPIP2;1 into biomimetic membranes for biotechnological applications

    Inference of Co-Evolving Site Pairs: an Excellent Predictor of Contact Residue Pairs in Protein 3D structures

    Get PDF
    Residue-residue interactions that fold a protein into a unique three-dimensional structure and make it play a specific function impose structural and functional constraints on each residue site. Selective constraints on residue sites are recorded in amino acid orders in homologous sequences and also in the evolutionary trace of amino acid substitutions. A challenge is to extract direct dependences between residue sites by removing indirect dependences through other residues within a protein or even through other molecules. Recent attempts of disentangling direct from indirect dependences of amino acid types between residue positions in multiple sequence alignments have revealed that the strength of inferred residue pair couplings is an excellent predictor of residue-residue proximity in folded structures. Here, we report an alternative attempt of inferring co-evolving site pairs from concurrent and compensatory substitutions between sites in each branch of a phylogenetic tree. First, branch lengths of a phylogenetic tree inferred by the neighbor-joining method are optimized as well as other parameters by maximizing a likelihood of the tree in a mechanistic codon substitution model. Mean changes of quantities, which are characteristic of concurrent and compensatory substitutions, accompanied by substitutions at each site in each branch of the tree are estimated with the likelihood of each substitution. Partial correlation coefficients of the characteristic changes along branches between sites are calculated and used to rank co-evolving site pairs. Accuracy of contact prediction based on the present co-evolution score is comparable to that achieved by a maximum entropy model of protein sequences for 15 protein families taken from the Pfam release 26.0. Besides, this excellent accuracy indicates that compensatory substitutions are significant in protein evolution.Comment: 17 pages, 4 figures, and 4 tables with supplementary information of 5 figure

    Correlated Mutations: A Hallmark of Phenotypic Amino Acid Substitutions

    Get PDF
    Point mutations resulting in the substitution of a single amino acid can cause severe functional consequences, but can also be completely harmless. Understanding what determines the phenotypical impact is important both for planning targeted mutation experiments in the laboratory and for analyzing naturally occurring mutations found in patients. Common wisdom suggests using the extent of evolutionary conservation of a residue or a sequence motif as an indicator of its functional importance and thus vulnerability in case of mutation. In this work, we put forward the hypothesis that in addition to conservation, co-evolution of residues in a protein influences the likelihood of a residue to be functionally important and thus associated with disease. While the basic idea of a relation between co-evolution and functional sites has been explored before, we have conducted the first systematic and comprehensive analysis of point mutations causing disease in humans with respect to correlated mutations. We included 14,211 distinct positions with known disease-causing point mutations in 1,153 human proteins in our analysis. Our data show that (1) correlated positions are significantly more likely to be disease-associated than expected by chance, and that (2) this signal cannot be explained by conservation patterns of individual sequence positions. Although correlated residues have primarily been used to predict contact sites, our data are in agreement with previous observations that (3) many such correlations do not relate to physical contacts between amino acid residues. Access to our analysis results are provided at http://webclu.bio.wzw.tum.de/~pagel/supplements/correlated-positions/

    Interrogating and Predicting Tolerated Sequence Diversity in Protein Folds: Application to E. elaterium Trypsin Inhibitor-II Cystine-Knot Miniprotein

    Get PDF
    Cystine-knot miniproteins (knottins) are promising molecular scaffolds for protein engineering applications. Members of the knottin family have multiple loops capable of displaying conformationally constrained polypeptides for molecular recognition. While previous studies have illustrated the potential of engineering knottins with modified loop sequences, a thorough exploration into the tolerated loop lengths and sequence space of a knottin scaffold has not been performed. In this work, we used the Ecballium elaterium trypsin inhibitor II (EETI) as a model member of the knottin family and constructed libraries of EETI loop-substituted variants with diversity in both amino acid sequence and loop length. Using yeast surface display, we isolated properly folded EETI loop-substituted clones and applied sequence analysis tools to assess the tolerated diversity of both amino acid sequence and loop length. In addition, we used covariance analysis to study the relationships between individual positions in the substituted loops, based on the expectation that correlated amino acid substitutions will occur between interacting residue pairs. We then used the results of our sequence and covariance analyses to successfully predict loop sequences that facilitated proper folding of the knottin when substituted into EETI loop 3. The sequence trends we observed in properly folded EETI loop-substituted clones will be useful for guiding future protein engineering efforts with this knottin scaffold. Furthermore, our findings demonstrate that the combination of directed evolution with sequence and covariance analyses can be a powerful tool for rational protein engineering

    Four Distances between Pairs of Amino Acids Provide a Precise Description of their Interaction

    Get PDF
    The three-dimensional structures of proteins are stabilized by the interactions between amino acid residues. Here we report a method where four distances are calculated between any two side chains to provide an exact spatial definition of their bonds. The data were binned into a four-dimensional grid and compared to a random model, from which the preference for specific four-distances was calculated. A clear relation between the quality of the experimental data and the tightness of the distance distribution was observed, with crystal structure data providing far tighter distance distributions than NMR data. Since the four-distance data have higher information content than classical bond descriptions, we were able to identify many unique inter-residue features not found previously in proteins. For example, we found that the side chains of Arg, Glu, Val and Leu are not symmetrical in respect to the interactions of their head groups. The described method may be developed into a function, which computationally models accurately protein structures

    Mutations in or near the Transmembrane Domain Alter PMEL Amyloid Formation from Functional to Pathogenic

    Get PDF
    PMEL is a pigment cell-specific protein that forms physiological amyloid fibrils upon which melanins ultimately deposit in the lumen of the pigment organelle, the melanosome. Whereas hypomorphic PMEL mutations in several species result in a mild pigment dilution that is inherited in a recessive manner, PMEL alleles found in the Dominant white (DW) chicken and Silver horse (HoSi)β€”which bear mutations that alter the PMEL transmembrane domain (TMD) and that are thus outside the amyloid coreβ€”are associated with a striking loss of pigmentation that is inherited in a dominant fashion. Here we show that the DW and HoSi mutations alter PMEL TMD oligomerization and/or association with membranes, with consequent formation of aberrantly packed fibrils. The aberrant fibrils are associated with a loss of pigmentation in cultured melanocytes, suggesting that they inhibit melanin production and/or melanosome integrity. A secondary mutation in the Smoky chicken, which reverts the dominant DW phenotype, prevents the accumulation of PMEL in fibrillogenic compartments and thus averts DW–associated pigment loss; a secondary mutation found in the Dun chicken likely dampens a HoSi–like dominant mutation in a similar manner. We propose that the DW and HoSi mutations alter the normally benign amyloid to a pathogenic form that antagonizes melanosome function, and that the secondary mutations found in the Smoky and Dun chickens revert or dampen pathogenicity by functioning as null alleles, thus preventing the formation of aberrant fibrils. We speculate that PMEL mutations can model the conversion between physiological and pathological amyloid
    • …
    corecore